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ABSTRACT 

This paper proposes an analytic model for dimensioning 
OFDMA based networks like WiMAX and LTE systems. 
In such a system, users require a number of subchannels 
which depends on their SNR, hence of their position and 
the shadowing they experience. The system is overloaded 
when the number of required subchannels is greater than the 
number of available subchannels. We give an exact though 
not closed expression of the loss probability and then give 
an algorithmic method to derive the number of subchan- 
nels which guarantees a loss probability less than a given 
threshold. We show that Gaussian approximation lead to 
optimistic values and are thus unusable. We then introduce 
Edgeworth expansions with error bounds and show that by 
choosing the right order of the expansion, one can have an 
approximate dimensioning value easy to compute but with 
guaranteed performance. As the values obtained are highly 
dependent from the parameters of the system, which turned 
to be rather undetermined, we provide a procedure based 
on concentration inequality for Poisson functionals, which 
yields to conservative dimensioning. This paper relies on re- 
cent results on concentration inequalities and establish new 
results on Edgeworth expansions. 
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1. INTRODUCTION 

Future wireless systems will widely rely on OFDMA (Or- 
thogonal Frequency Division Multiple Access) multiple ac- 
cess technique. OFDMA can satisfy end user's demands in 
terms of throughput. It also fulfills operator's requirements 
in terms of capacity for high data rate services. Systems 
such as 802. 16e and 3G-LTE (Third Generation Long Term 
Evolution) already use OFDMA on the downlink. Dimen- 
sioning of OFDMA systems is then of the utmost importance 
for wireless telecommunications industry. 



OFDM (Orthogonal Frequency Division Multiplex) is a multi 
carrier technique especially designed for high data rate ser- 
vices. It divides the spectrum in a large number of frequency 
bands called (orthogonal) subcarriers that overlap partially 
in order to reduce spectrum occupation. Each subcarrier has 
a small bandwidth compared to the coherence bandwidth of 
the channel in order to mitigate frequency selective fading. 
User data is then transmitted in parallel on each sub car- 
rier. In OFDM systems, all available subcarriers are affected 
to one user at a given time for transmission. OFDMA ex- 
tends OFDM by making it possible to share dynamically 
the available subcarriers between different users. In that 
sense, it can then be seen as multiple access technique that 
both combines FDMA and TDMA features. OFDMA can 
also be possibly combined with multiple antenna (MIMO) 
technology to improve either quality or capacity of systems. 




Figure 1: OFDMA principle : subcarriers are allo- 
cated according to the required transmission rate 

In practical systems, such as WiMAX or 3G-LTE, subcarri- 
ers are not allocated individually for implementation reasons 
mainly inherent to the scheduler design and physical layer 
signaling. Several subcarriers are then grouped in subchan- 
nels according to different strategies specific to each system. 
In OFDMA systems, the unit of resource allocation is mainly 
the subchannels. The number of subchannels required by a 
user depends on his channel's quality and the required bit 
rate. If the number of demanded subchannels by all users 
in the cell is greater than the available number of subchan- 
nel, the system is overloaded and suffer packet losses. The 
questions addressed here can then be stated as follows: how 
many subchannels must be assigned to a BS to ensure a 



small overloading probability ? Given the number of avail- 
able subchannels, what is the maximum load, in terms of 
mean number of customers per unit of surface, that can be 
tolerated ? Both questions rely on accurate estimations of 
the loss probability. 

The objectives of this paper are twofold: First, construct 
and analyze a general performance model for an isolated 
cell equipped with an OFDMA system as described above. 
We allows several classes of customers distinguished by their 
transmission rate and we take into account path-loss with 
shadowing. We then show that for a Poissonian configura- 
tion of users in the cell, the required number subchannels 
follows a compound Poisson distribution. The second ob- 
jective is to compare different numerical methods to solve 
the dimensioning problem. In fact, there exists an algorith- 
mic approach which gives the exact result potentially with 
huge memory consumption. On the other hand, we use and 
even extend some recent results on functional inequalities 
for Poisson processes to derive some approximations formu- 
las which turn to be rather effective at a very low cost. 
When it comes to evaluate the performance of a network, 
the quality of such a work may be judged according to sev- 
eral criteria. First and foremost, the exactness is the most 
used criterion: it means that given the exact values of the 
parameters, the real system, the performances of which may 
be estimated by simulation, behaves as close as possible to 
the computed behavior. The sources of errors are of three 
kinds: The mathematical model may be too rough to take 
into account important phenomena which alter the perfor- 
mances of the system, this is known as the epistemic risk. 
Another source may be in the mathematical resolution of 
the model where we may be forced to use approximate al- 
gorithms to find some numerical values. The third source 
lies in the lack of precision in the determination of the pa- 
rameters characterizing the system: They may be hard, if 
not impossible, to measure with the desired accuracy. It is 
thus our point of view that exactness of performance analy- 
sis is not all the matter of the problem, we must also be able 
to provide confidence intervals and robust analysis. That is 
why, we insist on error bounds in our approximations. 

Resources allocation on OFDMA systems have been exten- 
sively studied over the last decade, often with joint power 
and subcarriers allocation, see for instance [1, 8, 14, 15]. 
The problem of OFDMA planning and dimensioning have 
been more recently under investigation. In [7], the authors 
propose a dimensioning of OFDMA systems focusing on 
link outage but not on the other parameters of the sys- 
tems. In [11], the authors give a general methodology for 
the dimensioning of OFDMA systems, which mixes a simu- 
lation based determination of the distribution of the signal- 
to-interference-plus-noise ratio (SINR) and a Markov chain 
analysis of the traffic. In [3, 9], the authors propose a di- 
mensioning method for OFDMA systems using Erlang's loss 
model and Kaufman-Roberts recursion algorithm. In [4], 
the authors study the effect of Rayleigh fading on the per- 
formance of OFDMA networks. 

The article is organized as follows. In Section 2, we describe 
the system model and set up the problem. In Section 3, 
we examine four methods to derive an exact, approximate 
or robust value of the number of subchannels necessary to 



ensure a given loss probability. In Section 4, we apply these 
formulas to the particular situation of OFDMA systems. A 
new bound for the Edgeworth expansion is in Section B and 
Section C contains a new proof of the concentration inequal- 
ity established for instance in [16]. 

2. SYSTEM MODEL 

In practical systems, such as WiMAX or 3G-LTE, resource 
allocation algorithms work at subchannel level. The subcar- 
riers are grouped into subchannels that the system allocates 
to different users according to their throughput demand and 
mobility pattern. For example, in WiMAX, there are three 
modes available for building subchannels: FUSC (Fully Par- 
tial Usage of Subchannels), PUSC (Partial Usage of Sub- 
Channels) and AMC (Adaptive modulation and coding). In 
FUSC, subchannels are made of subcarriers spread over all 
the frequency band. This mode is generally more adapted 
to mobile users. In AMC, the subcarriers of a subchannel 
are adjacent instead of being uniformly distributed over the 
spectrum. AMC is more adapted to nomadic or stationary 
users and generally provides higher capacity. 

The grouping of subcarriers into subchannels raises the prob- 
lem of the estimation of the quality of a subchannel. The- 
oretically channel quality should be evaluated on each sub- 
carrier of the corresponding subchannel to compute the as- 
sociated capacity. This work assumes that it is possible to 
consider a single channel gain for all the subcarriers making 
part of a subchannel (for example via channel gains evalu- 
ated on pilot subcarriers). 

We consider a circular cell C of radius R with a base sta- 
tion (BS for short) at its center. The transmission power 
dedicated to each subchannel by the base station is denoted 
by P. Each subchannel has a total bandwidth W (in kHz). 
The received signal power for a mobile station at distance d 
from the BS can be expressed as 

PK 

P(d) = L^GF := P,Gd-\ (1) 

where if 7 is a constant equal to the attenuation at a refer- 
ence distance, denoted by d TC f, that separates far field from 
near field propagation. Namely, 

where / is the radio-wave frequency. The variable 7 is the 
path-loss exponent which indicates the power at which the 
path loss increases with distance. Its depends on the spe- 
cific propagation environment, in urban area, it is in the 
range from 3 to 5. It must be noted that this propagation 
model is an approximate model, difficult to calibrate for real 
life situations. In particular, it might be reasonable to en- 
vision models where 7 depends on the distance so that the 
attenuation would be proportional to d l( - d \ Because of the 
complexity of such a model, 7 is often considered as constant 
but the path-loss is multiplied by two random variables G 
and F which represent respectively the shadowing, i.e. the 
attenuation due to obstacles, and the Rayleigh fading, i.e. 
the attenuation due to local movements of the mobile. Usu- 
ally, G is taken as a log-normal distribution: G = 10 S//1 °, 
where S ~ AT(k, v 2 ). As to F, it is customary to choose an 



exponential distribution with parameter 1. Both, the shad- 
owing and the fading experienced by each user are supposed 
to be independent from other users' shadowing and fading. 
For the sake of simplicity, we will here treat the situation 
where only shadowing is taken into account, the computa- 
tions would be pretty much like the forthcoming ones and 
the results rather similar should we consider Rayleigh fad- 
ing. 

All active users in the cell compete to have access to some of 
the iV a vaii available subchannels. There are K classes of users 
distinguished by the transmission rate they require: Ck is 
the rate of class k customers and Tk denotes the probability 
that a customer belongs to class k. A user, at distance d 
from the BS, is able to receive the signal only if the signal- 
to-interference-plus-noise ratio SNR = is above some 
constant /3 m i n where / is the noise plus interference power 
and P(d) is the received signal power at distance d, see (1). If 
the SNR is below the critical threshold, then the user is said 
to be in outage and cannot proceed with his communication. 

To avoid excess demands, the operator may impose a max- 
imum number Af max of allocated subchannels to each user 
at each time slot. According to the Shannon formula, for a 
user demanding a service of bit rate Ck, located at distance 
d from the BS and experiencing a shadowing g, the number 
of requires subchannels is thus the minimum of iV max and of 



set 



Ck 



w\o g2 (i + p l9 d-yi) 



if Pjgd ~< /I > fimin, 

otherwise, 



where \x\ means the minimum integer number not smaller 
than x. 

We make the simplifying assumption that the allocation is 
made at every time slot and that there is no buffering neither 
in the access point nor in each mobile station. All the users 
have independently from others a probability p to have a 
packet to transmit at each slot. This means, that each user 
has a traffic pattern which follows a geometric process of 
intensity p. We also assume that users are dispatched in the 
cell according to a Poisson process of intensity Ao . According 
to the thinning theorem for Poisson processes, this induces 
that active users form a Poisson process of intensity A = 
Aop. This intensity is kept fixed over the time. That may 
result from two hypothesis: Either we consider that for a 
small time scale, users do not move significantly and thus 
the configuration does not evolve. Alternatively, we may 
consider that statistically, the whole configuration of active 
users has reached its equilibrium so that the distribution of 
active users does not vary through time though each user 
may move. 

From the previous considerations, a user is characterized by 
three independent parameters: his position, his class and 
the intensity of the shadowing he is experiencing. We model 
this as a Poisson process on E — 5(0, R) x {1, • • • , K} x R + 
of intensity measure 

A dv(x) := A( da: ® dr(fe) <g) dp(g)) 

where B(0, R) = {x € R 2 , < R}, r is the probability 
distribution of classes given by r({fc}) = r fe and p is the 
distribution of the random variable G defined above. We 



f(x, k, g) = min (A" ma x 
1 {P- t g\W\-<>if3 m ,r l } 



Ck 



W\og 2 (l + P^g\\x\\-yi) 
With the notations of Section B, 

iVtot = / f(x, k, g) dui(x, k, g). 

./cell 

We are interested in the loss probability which is given by 

P(iVtot > iVavail). 

We first need to compute the different moment of / with 
respect to v in order to apply Theorem 2 and Theorem 3. 
For, we set 



h = N n 



A 



Ck 



VRog 2 (l + /3 min ) 
where a A b — min(a, b). Furthermore, we introduce fik, o 

CO, 

Pk.i = Ji (f* /wl - l) , 1 < k < K, 1 < / < l k - 1, 
and p k ,i k = If3 m in/P- 

By the very definition of the ceiling function, we have 



i E 



K l k 



= 1t, Tk i>2 lP [ [ Muk.i-.Pk.i^iaM 7 ) dp( 5 ) dx. 

k=l 1=1 JceUJR 

According to the change of variable formula, we have 
/ 1 W k , l ,h,i-i)(9\\x\r' < ) dx 

J cell 



= 7 r(^Ai? 2 -/3- 2 /_>i?V /7 . 
Thus, we have 

(g\\x\D dp(g) dx 

./cell J R 

= A R 2 - P~ k y_\ A R 2 )E [lO s ^<] 

= 7r(/3- 2/ ^ A R 2 - p~ 2 /_\ A R 2 ) 1 < K +W lnl °)/^ : = Cm . 
We thus have proved the following theorem. 
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Theorem 1. For any p > 0, with the same notations as 
above, we have: 



f f dv = j2 T «lL lP ^ 



(2) 



3. LOSS PROBABILITY 
3.1 Exact method 

Since / is deterministic, At ot follows a compound Poisson 
distribution: it is distributed as 



where (Nk,i, 1 < k < K, 1 < I < lk) are independent Pois- 
son random variables, the parameter of Nkj is \TkCk,l- Us- 
ing the properties of Poisson random variables, we can re- 
duce the complexity of this expression. Let L — max(lk, 1 < 
k<K) and for / € {1, • • • , L}, let K t = {k, h > I}. Then, 
A/tot is distributed as 

L 

i=i 

where (Mi, 1 < I < l k ) are independent Poisson random 
variables, the parameter of Mi being m; := ~}2 keKl ArfcCfc.i- 
For each I, it is easy to construct an array which represents 
the distribution of IMi by the following rule: 



Pi(w) 



if w mod I =fc 0, 

exp(— mi)m1/q\ iiw = ql. 



By discrete convolution, the distribution of N to t and then its 
cumulative distribution function, are easily calculable. The 
value of -/V ava ii which ensures a loss probability below the 
desired threshold is found by inspection. The only difficulty 
with this approach is to determine where to truncate the 
Poisson distribution functions for machine representation. 
According to large deviation theory [6], 

P(Poisson(6>) > a8) < exp(-6>(alna + 1 - a)). 

When 9 is known, it is straightforward to choose a(6) so 
that the right-hand-side of the previous equation is smaller 
than the desired threshold. The total memory size is thus 
proportional to max(m(a(m;)i, 1 < I < It)- This may be 
memory (and time) consuming if the parameters of some 
Poisson random variables or the threshold are small. This 
method is well suited to estimate loss probability since it 
gives exact results within a reasonable amount of time but 
it is less useful for dimensioning purpose. Given iV ava ii, if we 
seek for the value of A which guarantees a loss probability 
less than the desired threshold, there is no better way than 
trial and error. At least, the subsequent methods even im- 
precise may help to evaluate the order of magnitude of A for 
the first trial. 



3.2 Approximations 

We begin by the classical Gaussian approximation. It is 
clear that 



>([ / do; > A/ avail ) = P( / /„( dw- A di/) > N a ) 

J E J E 



where N a — (iV avai i — J /A dv)/a. Since the indicator func- 
tion l[Nv, +oo) is not Lipschitz, we can not apply the bound 
given by Theorem 2. However, we can upper-bound the in- 
dicator by a continuous function whose Lipschitz norm is 
not greater than 1. For instance, taking 



<t>(x) = min(a; + , 1) and <Pn(x) = <f>(x — N), 



Hence, 

<P(/ / d^ > A/ avail ) < 

J E 

i-W-D + Wf^, (3) 

where Q is the cumulative distribution function of a stan- 
dard Gaussian random variable. 

According to Theorem 3, one can proceed with a more accu- 
rate approximation. Via polynomial interpolation, it is easy 
to construct a C 3 function ip N such that 

||(V'iv) (3) ||oo < 1 and l[jv a+3 .5, +00 ) < 4> l Na < 
and a function %p r N such that 

||(V>Ar) (3) ||°° < 1 an d l[jv ff ,+oo) < Vn„ < l[jv a -s.5, +oo) 
From (10), it follows that 

l-Q(N a + 3.5) - m( W 3) (N a + 3.5) - E x 



6^ 

<P(/ / dco > A/ avail ) < 

J E 

1 - Q(N a - 3.5) + m(3 L L) Q^(N a - 3.5) + E x (4) 
6v A 

where E x is the right-hand-side of (13) with ||^ (3) = 1. 

Going again one step further, following the same lines, ac- 
cording to (15), one can show that 



P( [ f du>> Af avail ) < 1 - Q{N a - 6.5) 

J E 



+ ^jS Q ^(N a 6.5) + H%±Iq^(N„ - 6.5) 
6V A ' 2A 

+ r^ Q(3)(iVCT _ 65) + FA (5) 
where F\ is bounded above in (16). 

For all the approximations given above, for a fixed value of 
A/avaii, an approximate value of A can be obtained by solving 
numerically an equation in \f\. 

3.3 Robust upper-bound 

If we seek for robustness and not precision, it may be inter- 
esting to consider the so-called concentration inequality. We 
remark that in the present context, / is non-negative and 
bounded by L = max fe l k so that we are in position to apply 
Theorem 4. We obtain that 



P(/ fduj> [ fdu + a) 

J E J E 



we have 



< exp — 



$ E f\dv ah 



g{ 



V yy f E f 2 \dv 



)), (6) 



l[JV„+l,+oo) < <t>N a +l < l[JV„,+oo) < <t>N„-l < l[jv a -l,+oo)- 



where g is defined in Section C. 



4. APPLICATIONS TO OFDMA AND LTE 

In such systems, there is a huge number of physical param- 
eters with a wide range of variations, it is thus rather hard 
to explore the while variety of sensible scenarios. For illus- 
tration purposes, we chose a circular cell of radius R = 300 
meters equipped with an isotropic antenna such that the 
transmitted power is 1 W and the reference distance is 10 
meters. The mean number of active customers per unit of 
surface, denoted by A, was chosen to vary between 0, 001 
and 0.000 1, this corresponds to an average number of ac- 
tive customers varying from 3 to 30, a realistic value for 
the systems under consideration. The minimum SINR is 
0.3 dB and the random variable S defined above is a cen- 
tered Gaussian with variance equal to 10. There are two 
classes of customers, Ci = 1,000 kb/s and C2 = 400 kb/s. 
It must be noted that our set of parameters is not universal 
but for the different scenarios we tested, the numerical facts 
we want to point out were always apparent. Since the time 
scale is of the order of a packet transmission time, the traf- 
fic is defined as the mean number of required subchannels 
at each slot provided that the time unit is the slot duration, 
that is to say that the load is defined as p = A J f Ais. 



1 c - 7MI 
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Figure 2: Impact of 7 and r on the loss probability 

(iVavail =92, A = 0.0001) 

Figure 2 shows, the loss probability may vary up to two 
orders of magnitude when the rate and the probability of 
each class change even if the mean rate J^, TtCk remains 
constant. Thus mean rate is not a sufficient parameter to 
predict the performances of such a system. The load p is 
neither a pertinent indicator as the computations show that 
the loads of the various scenarios differs from less than 3%. 

Comparatively, Figure 2 shows that variations of 7 have 
tremendous effects on the loss probability: a change of a 
few percents of the value of 7 induces a variation of several 
order of magnitude for the loss probability. It is not sur- 
prising that the loss probability increases as a function of 7: 
as 7 increases, the radio propagation conditions worsen and 
for a given transmission rate, the number of necessary sub- 
channels increases, generating overloading. Beyond a certain 
value of 7 (apparently around 3.95 on Figure 2), the radio 
conditions are so harsh that a major part of the customers 
are in outage since they do not satisfy the SNR criterion any 



longer. We remark here that the critical value of 7 is almost 
the same for all configurations of classes. Indeed, the criti- 
cal value 7 C of 7 can be found by a simple reasoning: When 
7 < 7c a class k customer uses less than the allowed Ik sub- 
channels because the radio conditions are good enough for 
fi]J~j > R f° r some j < Ik so that the load increases with 7. 

For 7 > 7c, all the /J^ 1 / 7 are l° wer than R and the larger 7, 
the wider the gap. Hence the number of customers in outage 
increases as 7 increases and the load decreases. Thus, 



7c 



inf{7,/3 s Y\ < R} for s = arg ma,x k l k - 



If we proceed this way for the data of Figure 2, we retrieve 
7c ~ 3.95. This means that for a conservative dimensioning, 
in the absence of estimate of 7, computations may be done 
with this value of 7. 

For a threshold given by e = 10 -4 , we want to find iV ava ii 
such that P(A?tc.t > A?" ava ii) < e. As said earlier, the exact 
method gives the result at the price of a sometimes lengthy 
process. In view of 3, one could also search for a such that 



- Q(a) + 



m(3, A) 



(7) 



and then consider [1 + f E fdv + aa] as an approximate 
value of 7V ava ii . Unfortunately and as was expected since the 
Gaussian approximation is likely to be valid for large values 
of A, the corrective term in (7) is far too large (between 30 
and 500 depending on 7) for (7) to have a meaning. Hence, 
we must proceed as usual and find a such that 1 — Q(a) = e, 
i.e. a ~ 3.71. The approximate value of -/V ava n is thus given 
by [ J E f Av + 3.71a] . The consequence is that we do not 
have any longer any guarantee on the quality of this ap- 
proximation, how close it is to the true value and even more 
basic, whether it is greater or lower than the correct value. 
In fact, it is absolutely impossible to choose a dimension- 
ing value lower than the true value since there is no longer 
a guarantee that the loss probability is lower than e. As 
shows Figure 3, it turns out that the values returned by the 
Gaussian method are always under the true value. Thus this 
annihilates any possibility to use the Gaussian approxima- 
tion for dimensioning purposes. 

Going one step further, according to (4), one may find a 
such that 



Q(a) 



ra(3, A) ^ (3) 



Q w (a) + E, 



and then use 



[3.5 + / / Av + aa] 

JE 

as an approximate guaranteed value of iV ava ii. By guaran- 
teed, we mean that according to (4), it holds for sure that 
the loss probability with this value of 7V ava ii is smaller than e 
even if there is an approximation process during its compu- 
tation. Since the error in the Edgeworth approximation is of 
the order of 1/A, instead of 1/%/A for the Gaussian approxi- 
mation, one may hope that this method will be efficient for 
smaller values of A. It turns out that for the data sets we 
examined, Ex is of the order of 10~ 7 /A, thus this method 
can be used as long as 10~ 7 /A <C e. Otherwise, as for the 



Gaussian case, we are reduced to find a such that 

l_Q (Q) _!^Q(3) (a) = e 
6 

and consider [3.5 + f E fdv + aa] but we no longer have 
any guarantee on the validity of the value. As Figure 3 
shows, for the considered data set, Edgeworth methods leads 
to an optimistic value which is once again absolutely not 
acceptable. One can pursue the development as in (15) and 
use (5), thus we have to solve 



Q(a) _m(|A) Q( 3 )(Q) 



i(3,l) 
72A 



-Q 



(5), 



a) + 



m(4,l) 
24A 



l(3) 



(a) 



For the analog of 4 to hold, we have to find a C% func- 
tion greater than lt Xl0a ) but smaller than lr^-iag, oo) with 
a fifth derivative smaller than 1. Looking for $ in the set 
of polynomial functions, we can find such a function only if 
lag is greater than 6.5 (for smaller value of the lag, the fifth 
derivative is not bounded by 1) thus the dimensioning value 
has to be chosen as: 

[6.5 + f f dv + aa] . 

J E 

For the values we have, it turns out that F\ is of the order 
of 10" 9 A- 3/2 which is negligible compared to e = 10 , so 
that we can effectively use this method for A > 10 -4 . As it 
is shown in Figure 3, the values obtained with this develop- 
ment are very close to the true values but always greater as 
it is necessary for the guarantee. The procedure should thus 
be the following: compute the error bounds given by (3), 
(13) and (5) and find the one which gives a value negligible 
with respect to the threshold e, then use the corresponding 
dimensioning formula. If none is suitable, use a finer Edge- 
worth expansion or resort to the concentration inequality 
approach. 

Note that the Edgeworth method requires the computations 
of the first three (or five) moments, whose lengthiest part is 
to compute the i which is also a step required by the exact 
method. Thus Edgeworth methods are dramatically simpler 
than the exact method and may be as precise. However, 
both the exact and Edgeworth methods suffer from the same 
flaw: There are precise as long as the parameters, mainly A 
and 7, are perfectly well estimated. The value of 7 is often 
set empirically (to say the least) so that it seems important 
to have dimensioning values robust to some estimate errors. 
This is the goal of the last method we propose. 

According to (6), if we find a such that 

aL . log(e)L 2 



g{ 



and 



iVavail = J f dlS+j^ J f'X dV, 

we are sure that the loss probability will fall under e. How- 
ever, we do not know a priori how larger this value of iV ava ii 
than the true value. It turns out that the relative oversizing 
increases with 7 from a few percents to 40% for the large 
value of 7 and hence small values of A^vail- For instance, for 



(8) 



7 = 4.2, the value of AT avail given by (8) is 40 whereas the 
exact value is 32 hence an oversizing of 25%. However, for 
7 = 4.12, which is 2% away from 4.2, the required number of 
subchannels is also 40. The oversizing is thus not as bad as 
it may seem since it may be viewed as a protection against 
traffic increase, epistemic risk (model error) and estimate 
error. 
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APPENDIX 

A. HERMITE POLYNOMIALS 

Let $ be the Gaussian probability density function: <b(x) = 
cxp(— x 2 /2)/\/2tv and /i the Gaussian measure on R. Her- 
mite polynomials (Hk, k > 0) are defined by the recursion 
formula: 

H k (x)*(x) = £z*(x). 
For the sake of completness, we recall that 

Ho(x) = 1, Hi(x) = x, H2{x) — x 2 — 1, H-i{x) = x 3 — 3x 
H 4 (x) = x A - 6x 2 + 3, H 6 (x) = x 5 - 10x 3 + 15a-. 

Thus, for F 6 Cj, using integration by parts, we have 

/ F^(x)dfx(x)= f F(x)H k (x) d»(x). (9) 
Jr Jr 

Let Q(x) = /^^(u) du = J R x] (u)$(w) du. Then, 

Q' = $ and 



L(— oo; x] 



(u)H k (u) d/j,(u) 



d k+1 

l(-oo;«](ti)^iE+rQ(«) 



Q (k \x) = H k -i(x)®(x). (10) 



B. EDGEWORTH EXPANSION 



For details on Poisson processes, we refer to [2, 5]. For E a 
Polish space equipped with a Radon measure v, Ye denotes 
the set of locally finite discrete measures on E. The generic 
element u> of Ye may be identified with a set u> = {x n , n > 
1} such that uj n K has finite cardinal for any K compact 
in E. We denote by j E f du> the sum ~^2 xeuj f(x) provided 
that it exists as an element of RU {+00} . A Poisson process 
of intensity v is a probability P„ on Ye, such that for any 
/ G C K (E, R), 



E„ 



P(- / / dw) = exp(- / 1 

J E J E 



-fix) 



dv{x)). 



For / G L 1 ^), the Campbell formula states that 



/ 



fduj 



fdv. 



We introduce the discrete gradient D defined by 

D x F{u)) = F(uj U {x}) - F(u), for all x G E. 
In particular, for / G L 1 ^), we have 



D. 



f fdu> = f{x). 

J E 



The domain of D, denoted by Dom D is the set of function- 
als F : Ye — > R such that 



E„ 



\D x F{u)\ 2 du(x) 



< 00. 



The integration by parts then says that, for any F G Dom D, 
any u G E 2 (y), 



E„ 



1 / u(x){ du(x) — dv(x)) 

J E 

= E„ / D x Fu(x) dv(x) 

J E 



(11) 



We denote by a — ||/|| i 2( I/ ) VA and / CT = f/a. Note that 
II/ctIU^M = 1A and tnat 



i(p, A) := / |/„W| p Ad^) 

J E 



rj ( „ ) \\f\\U^ 1 - p/2 - 



The proof of the following theorem may be found in [5, 12, 
13]. 



Theorem 2. Let f G L 2 (y). For A > 0, let 



TV" 



Ie 



ftr(x)( doj(x) — A dv{x)). 



Then, for any Lipschitz function F from R to R, we have 
E A „ [F(iV A )] - J F dfj, 



~ \\\ m(3 ' A )II F II^' 



To prove the Edgeworth expansion and its error bound, 
we introduce some notions of Gaussian calculus. For F G 
C 2 (R; R), we consider 

AF(x) = xF'(x) - F"(x), for any x G R. 

The Ornstein-Uhlenbeck semi-group is defined by 

P t F(x) = / F(e^x + y/l - e- 2t y) dfx(y) for any t > 0. 
Jr 



The infinitesimal generator A and Pt are linked by the fol- 
lowing identity 

p poo 

F{x) - / F(y) d/i(y) = - / AP t F(x) dt. (12) 

J R i() 



Theorem 3. For F G C 3 (R, R), 

|e*„ [f(tv a )] -^(y) **(!/) 

-| m(3, A) J^F{y)H,{y) d^y) 

< ( m(3, l) 2 + m(4, 1) /j\ ]^%o . 
— I 6 9 \ ty I A 

Proof. According to the Taylor formula, 
D X G(N X ) = G{N X + U(x)) - G{N X ) 

= G'(N x )Mx) + ±f2(x)G"(N x ) 

+ \f„{xf jy 2 G (3) (rN X + (1 - r)U(x)) dr. (14) 
Hence, according to (11) and (14), 
E Xl/ [N x (P t F)'(N x )] 



— Ea„ 

= Ea„ 

1 

+ 



/ U(x)D x {P t F)'(N x )\ dv{x) 

J E 

\{P t F)"{N x )] 
\ fjl{x)\ dv(x)E x „ [(P t P) (3) (iV A )] 



+ \ J ft{x)\ dv(x 
r /•! 



[\p t F) {A) {rN x + (1 -r)f a (x))r 2 dr 
Jo 



= A 1 +A 2 + A 3 . 

It is well known that for F G C k , (x i-> P t P(a;)) is fc + 1- 
times differentiable and that we have two expressions of the 
derivatives (see [10]): 



(P t P) (fc+1) (x) 



-(k+l)t 



-21 



[ FMie-'x+y/T- 
Jr 



e' 2t y)y djj,(y). 



and (P t P) (fc+1) (a>) = e- (fe+1) *P t F (fc) (x). The former equa- 
tion induces that 



(PtF) 



(H-i)i 



-(fc+i)t 



< 



l|f (fc) ll=o / \y\ Mv) 

JR 



,-(fc+l)t 



- IIP 



Ml 



Hence, 



l^| < 



6\/r 



-24 V 



m(4, A) ||P 



(3) I 



Moreover, according to Theorem 2, 

|e a „ [(P t P) (3) (iV A )] - (P t F)< 3 »(x) d/*(:r)| 

< \\J\™^ A)||(P t P)( 4 )|U 
= ij|m(3, A)e- 3t ||(P t p( 3 >)'| 



<im(3,A)- ' 



=IIP< 3 >I 



Then, we have, 



\A 2 - ImC 



i(3, A) / (P t Ff a \x) dn(x)\ 
Jr 



Hence, 

E A „ [iV A (PiP)'(iV A ) - (P t P)"(iV A )] 



| F (3)| 



= im(3, A) f (P t F)W(x) d»(x) + R(t), 



where 



fl(t)<i m(3 : A)2 + m(4 : A \/^liip (3) ' 



vr 



Now then, 



/ (PtF)W(x) d^x) 
Jr 

= e- 3t [ [ F (3) (e- t x+ e- 2t y) dfi{y) 

Jr. Jr 

= e- 3t / F {3 \y) MV) 
Jr 

= e- m f F(y)H 3 (y) d^y), 
Jr 

since the Gaussian measure on R 2 is rotation invariant and 
according to (9). Remarking that 

poo 

/ e - 4t (l-e- 2t r 1/2 dt = 2/3 
Jo 

and applying (12) to x = N x , the result follows. □ 

This development is not new in itself but to the best of our 
knowledge, it is the first time that there is an estimate of the 
error bound. Following the same lines, we can pursue the 
expansion up to any order provided that P be sufficiently 
differentiable. Namely, for P G C£, we have 

Eai/ [F(N X )] = j F(y) Mv) 

+ i§r L Fi3){y) My) + L F{5,{y) My) 

+ L FiA){y) Mv) + fA\f^\\oo. (15) 



where 

„ , m(3,l) / 2 2 



4 n\ 12 



+ (l35 + 128V^ m(4 ' 1) l- (16) 



C. CONCENTRATION INEQUALITY 

We are now interested in an upper bound, which is called 
concentration inequality. 

Theorem 4. Let M,a > 0. Assume that \f(z)\ < M 
v—a.s and f G L 2 (E, v), then 

■■<'> -I* ,n) 

■where g(u) — (1 + u) ln(l + u) — u. 



The above theorem can be directly derived from [16]. How- 
ever let us take this opportunity to prove this theorem in a 
very nice, simple and elementary fashion, exactly the same 
way as Bennett built his concentration inequality for the 
sum of n i.i.d random variables. 

Proof. Using Chernoff's bound we have: 
P(F > E [F] + a) < E [e 9F ] / e e ^ +a) 

Now assume that \f(z)\ < M v— a.s . Observe that the 
function (e x — 1 — x)/x 2 is increasing on R (the value at 
is 1/2), we have that 

e 6f(z) - 1 < — 2 / (z) v a.s. 

Thus, 

P(F > E [F] + a) 

<ex P {/ ( f" ~V (*)) <M*)-*« 



e 9M -l-9M, 



= eXP \ V[F]-0a 

We find that = In (1 + aM/V [F]) /M minimizes the right- 
hand-side and thus we obtain (17). □ 



